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between reader and text • Following a brief ihtroductioh , the paper 
reviews current readability formula research, revealihg two strohg 
trends- — one toward greater itatistical accuracy ahd more 
compj ehensive measurement of text variables, the other toward greater 
convenience for the average computationally unskilled user^ Notihg 
the conflict inherent in these trends, the paper then looks at 
alternative approaches to measuring readability drawn from studies of 
the comprehensibility of jury instructions^ of government forms ahd 
regulations, of stories in children *s reading lessons, of children *s 
literature selections, of selections in languages other than Ehglish 
and from languages without previous traditions of writing. The paper 
then reviews research on language processing, contrasting the 
properties of readability formulas with what is currently known about 
how language is acquired and used* The paper concludes that research 
beyond formulas, investigating w^ language and analyzing the 

psychology and linguistics of language processing, will provide 
insight into the pressing question of how to ii^rove text 
readability. (Twenty-four references are included.) (FL) 
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Abstract 

This paper discusses current research on readability. Much 
research has concentrated ori using formulas to predict how 
readable a text is and has dealt primarily with statistics and 
predictions. This research has not looked at whether the 
formulas really work. What is most needed is the answer to the 
question of how the text can be made better, enabling students to 
read with less difficulty. Some recent research illustrating 
possible alternative approaches to dealing with the complexity of 
tFXts and language for children and adults in the United States, 
as well as in non-English speaking countries, is discussed and 
surveyed. Research that goes beyond formulas, by investigating 
written language and analyzing the psychology and linguistics of 
language processing, will provide insight into the pressing 
question of how to improve text readability. 
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Readability- -The Situation Today 

This paper sarveys current research on readability, taking 
the term in a much more general sense than it is usually taken. 
There are wider and narrower senses in which the term readability 
can be taken. In the narrower sense, it refers to the 
development and use of readability formulas arid related objective 
methods which use a small number of measures of variables such as 
average number of words, syllables, etc., in a senterice or text. 
For a series of excellent surveys and discussion of work on 
readability formulas, including their successes and failures, 
there is rio better source than the book and articles by G. Klare 
(1963, 1974-75, 1984). But readability formulas were first 
created to answer a number of very broad questions- -what makes a 
text difficult to read? What will predict that readers with 
particular levels of skills can read a particular text? (Here i 
am using text in its technical sense as sequence of connect ed 
sentences . ) 

These questions remairi largely unanswered even today, if we 
think in terms of a model of reading comprehension applied to 
linguistic features of the text. There has been much interesting 
and productive research on features of texts, such as general 
content and overall organization, in relation to readers' 
knowledge and ability to make sense of the information. But very 
little is understood about how the structure of sentences and the 
nature of the words used might affect comprehension of a text. 

The successes of formulas have been statistical. For a 
large number of readers with varying abilities, and for large 
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numbers of texts with varying sentence and word lengths, formulas 
can be used to make fairly successful predictions. But for more 
specific cases, they become less and less sensitive to special 
features of texts and readers. One particular question is often 
asked: Can this studenr, or ^this group of people, read this 
text? If not, why not? What can bis done to improve the chances 
that certain readers will comprehend a certain text? 

In addition, there is a problem of general theoretical 
interest. Readability formulas measure averages for length of 
sentence, and length or complexity or familiarity of words, which 
can vary in different parts of a longer text. These measures are 
supposed to reflect complexity of language which, iri some creates 
barriers to comprehension. The nature of the barrier, or at 
least one tyipe of obstacle to comprehension, is plausibly 
described as some sort of overload on the ability of the reader 
to process a certain quantity of linguistic information in a 
single short interval. But we know very little about what is 
affected and how. 

Discussions of readability can be understood both as 
concerned with readability formulas , with the specific issues 
appropriate to these statistically-based, objective predictions 
or, on the other hand, as a set of more general issues, many of 
which are completely independent of readability formulas. The 
central issue is what features of a text contribute to difficulty 
in comprehending its content? The question of difficulty may 
include linguistic variables, such as sentence structure and 
complexity of words or the information conveyed by the words. 
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But it also involves the abilities of the reader, as well as the 
reader's background knowledge and perception of the situation in 
which reading a particular text is taking place. 

A great deal of research has been done on readability in the 
first sensiB, and is still going on in very much the same way that 
it has been sincis the formulas came into use. Not so much has 
been done in psychology, education, or linguistics to provide 
answers within a rigorous model of how language is processed and 
comprehended in various situations. In this paper, various kinds 
of research which are being done and which promise more 
satisfactory answers are surveyed. I will propose that only 
technical refinements can be made in research on readability 
formulas, and without research that focusses on the fundamental 
questions of how language is read and understood, we will not 
make much progress in understanding readability or in more 
effectively matching texts and readers. 

Current Research 
The two features of readability formulas which have made 
them useful and constitute their appeal can be summarized as 
follows . 

(1) They measure features of language in an objective way, 
with statistical accuracy in their predictions of 
levels of comprehension; and 

(2) As a sampling procedure, taking average values for 
small parts of larger texts, they reduce the task of 
assessing difficulty, and the caiculations can be done 
without special training or equipment. 
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For the moment we ignore potential challenges to these 
assertions. Much current research, as noted in Klare (1984) has 
been devoted to these two issues, concentrating on the 
statistical features of formulas. Norms have been recalculated 
for the McCall-Crabbs reading passages which serve as the 
criterion for the predictions of formulas. Certain formulas have 
been revised to reflect the performance of contemporary student 
populations, and others have been created to make predictions for 
adults reading technical materials. It is likely that general 
formulas will continue to be adapted for adult readers and rion- 
school materials. One of the strongest current demands placed on 
formulas is for predictions for adult readers, especially those 
with poor reading skills, who must read and comprehend technical 
or other demanding material. 

Research also continues to be done on the measures or 
predictors of readability, the features of the language in a text 
for which objective calculations are made. Because of the 
growing use of computers in finding and testing statistical 
correlations, and in integrating enormous amounts of information, 
it is possible to explore in much greater detail than before all 
the possible ways that readability levels can be calculated, and 
to find more and more specific features of text (letters per 
word, number of coordinating conjunctions, number of anaphoric 
words, etc.) which serve as predictors of difficulty. It is also 
possible to avoid a problem of sampling by taking many more 
samples of text at regular intervals, or even to calculate 
formula values for entire texts. 
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The use of large amounts of data with the help of computers 
has helped to overcome some of the criticisms which have been 
made in the last few years, that older formulas were out of date, 
and that word lists of familiar words and the McCall-Crabbs 
reading passages did not necessarily reflect reading skills today 
of the student popalation. The ability of computers to deal 
efficiently with large amounts of data has also overcome some of 
the objections to formulas based on sampling of passages from 
texts. But formulas in themselves often don't specify a sampling 
procedure which contributes to accurate predictions, and they 
certainly don't guarantee that a correct sampling is performed. 

The use of computers helps to ensure that readability 
formulas make accurate statistical predictions to the extent that 
they are capable of doing so. But this trend, with all its 
advantages, introduces a certain contradiction. If computation 
of readability levels requires the use of computers and the 
skills necessary to apply computers for this purpose, then the 
use of readability formulas is no longer in the hands of the 
average user, though this situation will probably change a little 
with the growing availability and use of microcomputers. Hence, 
very detailed and accurate use is not always within the reach of 
the ordinary user of the formulas. 

Another striking trend in research has concentrated on 
making existing formulas easier to apply than before. In some 
cases, this involves more efficient hand-counting of the 
linguistic variables; in others, it means more efficient 
calculation of the factors in the formulas --this is facilitated 
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by the recent availability of small calculators as well. The 
Raygor Readability Estimator (1977) is a splendid exampls of both 
aspects of simplification. Instead of counting all of the 
syllables in a 100-word sample, one counts only those words with 
more than six letters, a number of letters which can be 
determined by eye: ir. most cases, instead of having to be 
counted. The number of words of six letters or more is entered 
on a slide-rule like calculator, and the grade level is then read 
off a scale in relation to the number of sentence breaks in the 
sample. The small size and compactness of the calculator and the 
simplification of the counting procedure in fact make it very 
easy to use it in conjunction with a text of any length. The 
calculator itself is large enough to contain a printed warning 
about what kind of sampling procedure to use, what kind of text 
to apply it to, and specifically which kinds of text not to apply 
it to, and finally what degree of accuracy to expect. If the 
user follows these instructions, then formulas will be applied 
with a reasonable sampling to the right kind of text, and the 
result will be a prediction of an approximate readability level. 

There :ire two strong trends, then, in current readability 
research. One is towards greater statistical accuracy and more 
comprehensive measurement of text variables, achieved by working 
with large amounts of data; the other is toward greater 
convenience for the average computationally unskilled user, with 
some loss in fineness of detail or statistical accuracy. 
Clearly, these trends are in conflict, and one might ask if one 
and the same formula can really be asked to serve two such 
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differing purposes. One might also speculate that some different 
directions might be taken in the two areas of greater accuracy 
and user convenience. 

Some of the research in a new direction might deal with new 
aspects of texts. As Klare notes (1984, p. 085), foriuias are 
not sensitive to mor.t of the important features which seem to 
affect how well £ reader will comprehend the text. These include 
content, style, format arid organization, and, of these, formulas 
measure only style. It might be argued that they do not ieasure 
style either, except in the narrowest sense of sentence length 
and word complexity. Other features of style which are of a more 
•literary quality include the use of sentence structure and word 
choice to convey aspects of meaning in addition to the literal 
content of the text. But in any case, formulas are not sensitive 
to the motivations of the reader, the purpose for reading, or the 
amount of background knowledge which the reader already has about 
the subject matter in the text. 

It might be possible to reduce these factors to formula- like 
variables and to do statistical correlations for them, as with 
the other variables used. But, many of the linguistic factors 
are both difficult to identify without careful analysis of the 
text and also appear infrequently. Other factors, such as text 
organization, are difficult to reduce to objectively definable 
units, particularly since we know very little about hew 
discourses are really structured. Finally, we know very little 
about how factors such as tey.t organization and syntactic 
structures interrelate, if in fact they do. It appears that the 
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extension of formulas to cover other variables wdUld be useful 
and effective only if we had some well-founded hypotheses about 
how they affect comprehension. Statistical correlations with 
comprehension might be obtained by trial and error, but even if 
the results were interesting it is unlikely that they would be as 
informative about the process of comprehension as direct 
observation. 

Current refinement > of readability formulas may make the 
approach as effective as it will ever be for predictions about 
large aggregates of texts and readers. But we will not begin to 
understand v/hat makes a text complex and under what 
circumstances, unless we look directly at aspects of texts, 
readers and situations. To do this, we need to be concerned with 
understanding in a more general way how language is comprehended, 
and how skills are acquired in interpreting linguistic 
structures. In other words, the real questions of readability 
are questions of educational and cognitive psychology, 
linguistics and cognitive science. 

Current Approache s to Research on Readability Not Involving 
Formulas 

In this section, i will note some recent research which 
illustrate;5 possible alternative approaches to dealing with the 
com; lexity of texts and of the language in which ^hey are 
written. This survey is not meant to be exhaustive (for 
additional references and discussion, see Klare, 1984, p. 701ff). 
I want to include a diverse group of examples, even some which 
lie outside the topics usually discussed in connection with 
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formulas, in order to underline the fact that there is no single 
best approach to the great variety of problem?; of text 
difficulty. Rather, each specific situation needs to be 
approached in the terms appropriate to it- -what readers are 
involved, what their purpose in reading is, and the nature of the 
texts and language. Of course, it is hoped that when we have 
gained more understanding about how language and information are 
understood and remembered, then perhaps some unifying principles 
\vill emerge. 

The cases discussed below all involve the need to know what 
makes a text linguistically complex, and how to make it less so, 
or else how to match readers of different levels with texts 
within their ability. Two involve adult readers coping with 
technical materials: jury instructions and government 
regulations and forms. Another concerns the match of children's 
books with readers of the right age and level of ability, outside 
the context of school reading. Others are samples of projects 
being done in many societies, involving languages very much 
unlike English, perhaps with no trt.rlicion of writing, where 
school materials of appropriate levels must be chosen or written, 
in all of these cases, readability formulas are inappropriate or 
not useful, and other resources must be chosen from the ones 
which are available. 

J^^y ipstruo^tions . Robert and Veda Charrow (1979) studied 
how well adults comprehend the legal definitions given to members 
of juries, and compared the level of comprehension for the usual 
form in which the instructions are given with a form revised by 
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with poor comprehension. Jury instructions are definitions of 
principles of law, such as what constitutes contributory 
negligence . These standard definitions, composed by lawyers, are 
read to the members of the jury before they begin their 
deliberations. Their decision is to be related to these points 
of law- -that is, if the defendant is guilty of contributory 
negligence, in ^4.^ particular definition. There is both 
anecdotal and systematic evidence that most jurors, even those 
with education beyond high school, do not understand these 
definitions very well, though the more education a juror has, the 
better the instructions are understood. But clearly it is 
desirable that the average juror should be able to understand the 
principles which guide his or her decision. 

In the first part of the study, the sources of difficulty 
were located. A test of recall, which reflected comprehension, 
showed that difficulties of comprehension were associated with 
specific semantic and s3mtactic characteristics of the text. 
These included double negatives, parenthetical elements placed 
far away from the material they were related to, multiple 
subordinate clauses, and deleted elements. The revisions were 
made by substituting more explicit, less complex but equivalent 
sentence structures ^ in order to present the original content 
cl&arly. In general, the difficulty levels of the originals, as 
measured by readability formulas, did not change in the 
revisions . 
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jurors were asked to listen to the same jury instructions, half 
in their original form, and half in their revision. Because each 
group saw some original and jsdine revised instructions, it was 
possible to compare performance for the two forms of each 
instruction. By a significant amount, the revised forms were 
cbfliprehend-d better than the original. The increase in 
comprehension was about 40% over the level found in the original 
form. 

From the point of view of the real world problem of making 
sure that jurors are adequately informed about the decision they 
are asked to make, the changes reported by the Charrows are not 
enormous. In some cases, the original level of comprehension was 
25%, but the improvement reached only 42%; we would rather have 
all or nearly all the jurors understand the instructions 
completely. But these results are still very interesting and 
important for two reasons. First, the increases in comprehension 
were unrelated to readability formulc.s. The readability levels 
for the revisions would hot have predicted the observed gains in 
comprehension, and, in fact, did predict increases for four 
instructions for which no significant results were seen. Second, 
the revisions were guided by features not connected with 
readability formulas; the sentences were not shortened and the 
long words were not replaced by shorter or more frequent ones. 
The investigators attempted to diagnose the possible difficulties 
by looking at both the content and the form of the text. The 
increase in comprehension appears to be caused by changes in the 
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outward form of the text^ the clarifications in the syntax and 
organization, iri spite of the fact that the content, vhich was 
complex, remained the same. One of the most intcrestirg and 
useful features of Charrbw and Charrow (1979) is the detailed 
discussion of each instruction, its particular difficulties, and 
how they were resolved. 

G overnment regul ations and forgs. It is widely perceived 
that government forms and regulations are very difficult for lay 
people to read and understand correctly, particularly those with 
little education and no access to expert help. One trend in the 
movement toward simplification has been to apply readability- 
formulas, to shorten sentences and simplify words, though with no 
evidence that the predictive power of formulas extends to what 
are very special and fragmentary texts of this kind (see Holland 
1981). Another i and unfortunately less popular, trend involves 
making use of information from potential readers which can be 
Used to get insights into how people go about understanding 
unfamiliar and abstract texts of this kind and how revisions can 
make better use of the resources which readers bring. 

In a revision of Medicaid forms, Redish and others (reported 
iri Holland, 1981) found that tbe users of the form were not clear 
about the meaning of some of the words and phrases used. They 
noted, however^ that caseworkers who used the original difficult 
form had evolved ways of paraphrasing the difficult parts and of 
giving specific explanations for questions. Some of these 
explanations were incorporated into the revision. Flower, Hayes, 
and Swarts (1980) found that people attempting to read complex 
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and abstract material, such as government regulations, do not 
concentrate so much on deciphering the long or complex sentences 
and hard words. What they do, as a strategy for understanding, 
is to translate abstract statements into specific instances, a 
series of related events, or a scenario. In a scenario the 
actors have particular goals and react to specific circumstances. 
Information expressed in this form, as a sequence of related 
events with identifiable causes and effects, seems to be clearer 
than the equivalent information summarized in condensed and 
abstract terms. People may also typically not realize what 
connections there are between items in a form, since they are not 
familiar with forms and the purposes they are used for. The 
group who revised the Medicaid form tried to help the applicants 
see thac- the form represented a coherent whole, with relations 
among thtt questions, by presenting the form as a kind of letter. 
Most people do know what kind of text a letter is, and they 
expect there to be connections among the parts of a letter. 

Increases in comprehension are often found with reader-based 
revisions like these (cf. Holland, 1981), but in some cases there 
are no observable effects. Walmsley, Scott, and Lahrer (1981) 
compared original and revised forms of health-related documents 
which were read by elderly people who answered questions about 
the content. Some revisions were done to reduce the levels; of 
difficulty of the originals according to the formulas. For 
others, skilled writers corrected difficulties in the text. Only 
for the longest document were any differences found in the level 
of success in answering comprehension questions. The revision 
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made by skilled writers for this one document showed gains for 
both good arid poor readers. The revisions done according to 
formulas showed no gain in comprehension, and even some loss; 
Readers showed a preference for all four of the documents that 
were revised by skilled writers. So even if revisions done with 
the readers and the content as the primary factors produce a gain 
only up to 10% in comprehension, it might be worthwhile to do 
more of this. The results seem to make the task of reading this 
kind of material less onerous and less unpleasant. 

With adults who have to read rather specialized texcs, it 
might be expected that readability formulas lose a lot of their 
predictive power, since the statistical strength of formulas is 
in large aggregates of different texts and different levels of 
ability (Rodriguez & Hansen, 1975). One response to this is to 
evolve very specialized formulas for a particular class of 
readers and texts with particular content. But while this 
approach might restore some of the statistical predictive power 
of a formula, it remains a superficial way of treating texts and 
readers. Alternatively, one could devote time and effort to 
learnir;g how readers understand texts and what particular 
difficulties they encounter. A formula makes certain 
predictions, which may or may not hold in a specific instance, 
arid there is no way of finding out why a given reader did or did 
riot cope with a text. The studies just surveyed were done in 
order to define feature:. s of text which could be made easier to 
understand for the audience in question, and, in particular, to 
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find out what resources the readers could use even if they were 
not highly skilled at reading. 

Stories in Ghil^r^tl's rea di^ig lessons . The subject of the 
research discussed here is quite familiar in the context c 
readability formulas. Formulas are often applied to the stories 
in children's reading textbooks to determine their relative 
difficulty, and revisions are often made in the text of these 
stories to improve the readability levels assigned them (but see 
Green and Laff [to appear] for evidence of the effectiveness of 
revisions). Beck, McKeown, Omanson, and Pople (1984) compared 
two versions of two stories of approximately second- or third- 
grade levels to see how much of this story children were able to 
recall and how well they could answer comprehension questions. 
What is of particular interest here is how the revised versions 
of the stories were created. A close analysis of the two stories 
was made to find possible sources of difficulty in the original 
texts. ITie revised version involved changes in th^se features of 
the text^ changes wbich were designed to correct for the 
difficulties. 

Most of these possible sources of confusion stemmed from 
ways that the content of the story was expressed, either 
linguiistically or in what was expressed, rather than what was 
implied. The linguistic factors included unclear reference to 
things in the text, ambiguous reference to antecedents and 
inexplicit or ambiguous temporal and causal relations. Problems 
with content included distractions in the text caused by 
irrelevant details and unexpressed important details which were 
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meant to be inferred in the original. Note that these factors 
are ones which a skilled writer or editor would pick out as flaws 
in a text which was suppbsisd to be clear and felicitous- -that is, 
to certain information which would hilp the reader to understand 
the consent. This information is especially important for 
younger, less skilled readers with imperfect background 
knowledge. Readability formulas are not sensitive to these text 
factors . 

For both skilled c*nd less skilled reaaers in third grade, 
the revised versions were understood better. The students 
recalled more of the stories and answered more comprehension 
questions. But, as in other studies of this nature, the gains 
were not tremendous, and performance overall was not impressively 
good. The percentage of correct answers to comprehension 
questions was 60% for the original version and 65% for the 
revised version. The ievt 1 of success for less skilled readers 
increased as much as the scores of skilled readers. 

In a related study (Gmanson, Beck, Vdss, 6t Mckebwn, 1984) 
the nature of the form of the stories was not changed but the 
reading lesson was revised, so that questions about segments of 
the stories were made more explicit and more closely related to 
the text being read. The revised reading lesson questions led to 
recall of much more of the central parts of the stories. In the 
unrevised condition, the parts of the stories which 50% or more 
of the children recalled were short, fragmentary and omitted the 
points on which the stories hinged. For the revised questions, 
the parts of the stories recalled included not only the main 
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characters but also more of the sequence of important '^vents. 
Again, increased comprehension is achieved without manipulating 
the text iri ways which would change the readability levels 
assigned to the texts. The increase is significant, although it 
may not seem enormous, and it does not approach perfection. 

Nevisrtheless, these approaches to increasing comprehension 
of a basal reader have a great deal of significance as 
interventions which are totally independent of readability 
formulas. The text elements which are affected are not those 
which could be picked out by a formula, and the changes iade do 
not alter the readability levels which would predict 
comprehension. What is most important, however, is that these 
interventions go directly to the central issues, reading a well- 
formed text and learning to pay attention to information in it. 
It is important to Sake sure that children use their efforts to 
read texts w?iich are not basically ill-formed and flawed, ones 
which have in them what children are learning to pay attention to 
and understand. 

Children's literature . Books published for children to 
read, or to have read to them, outside of school show a greater 
variety of subject matter than reading textbooks do, and a 
greater range of style, text structure and language than the 
selections of reading material in textbooks. The success of a 
'trade' book, as opposed to that of a textbook, depends directly 
on how well it is liked by the children who read it. A tradebook 
will remain in print and continue to be read by large numbers of 
children if it has literary qualities which are perceived and 
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liked by its readers, etiiidren identify with characters who are 
like them in some ways, particularly those who are their age or 
somewhat older. They may also be intrigued by a particular kind 
of story or amused by the imaginative use of characters and the 
expressive qualities of language- -puns, jokes, exaggerations and 
so on. Older children understand generalizations and causal 
relations better than very young school children. None of these 
qualities of a book could be easily measured by a readability 
formula in a way which t^ould distinguish between books which are 
likely to appeal to children of a particular age and those which 
probably will not. 

In deal5.ng with tradebooks for children, the best means of 
matching children of a particular age and reading ability with 
books they will like is not by formula, but by the judgment of a 
person who knows children and books. Although there has been 
mixed success in using people to judge the difficulty of books 
(Klare, 1984), it would seem unlikely that readability formulas 
could do any better, at least with tradebooks. What sets 
tradebooks apart is that they are generally not edited in 
accordance with readability levels, as textbooks generally are. 
This is true also of some very popular children's periodicals on 
science and current events. 

The features of tradebooks which make them popular are those 
which formulas are not sensitive to. In fact, many textbooks 
make use of excerpts from previously successful tradebooks, which 
are often better written than selections created specifically for 
reading textbooks. It is interesting that a current research 
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project on basai readers in primary grades shows that there are 
fewer discontinuities and unclear references to antecedents in 
stories excerpted from tradebooks than in stories written for 
basal readers (L. Meyer, p;c.)- 

Librarians in school and local libraries have direct 
experience with which books get read and by how many children. 
They are also often asked to suggest books to children of 
particular age levels and reading ability, with a certain amount 
of feedback on how well their suggestions were received. There 
are also people who read all the tradebooks published in order 
to review them in publications which advise librarians about 
buying new books. They have some confirmation of their judgment 
of the quality and age level of a book in its subsequent success 
or failure. Librarians and reviewers of children's books have a 
great deal of firsthand contact with a large number of books and 
with successive populations of children. They also have 
continuing feedback, from the children and from sales figures, 
as to how accurate their judgment is. This judgment is based on 
a number of fictors and on sensitivity not only to specific 
features but also to their interactions in a particular book, 
this experience and ability to make judgJients can be used as a 
substitute for formulas, provided one avoids unrealistic 
expectations, estimates are approximate and fall within broad age 
levels (such as grades three to six), and vary also with reading 
ability. Readability formulas are probably not any more 
accurate, given that the reading levels for a particular formula 
may be in error by one or two grade levels. (I am indebted for 
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discussion on these issues to Zena Sutherland, University of 
Chicago Graduate Library School). 

^-^^^•^g^ges other than English . As Klare has noted, there 
have beisn attempts to extend readability forn?ulas to languages 
other than English- -usually European languages whose syntax and 
word structure are not ver/ different. They are languages vjtih 
extensive written literature, both for adults and chi'idren. As 
various countries and language communities within countries 
attempt to find textbook material suitable for different levels 
of schooling and reading ability, it is possible to assess the 
relative merits of the formula style of a%)pioach and the 
alternatives which make use of existing resources. 

Languajge unlike English in st?'ucture^ writing system, etc. 
Although English is one of the two national languages of India, 
there are also a large number of regional languages used in 
different st£.tes. For example, Marathi is the majority language 
of the state of Maharashtra, but a Kannada language, of a 
different family, is used by a substantial minority. Both 
languages, like Marathi and Kannada, are very different in S3nitax 
from English, tvnd hnve much more complex morphology, so that the 
structure of words is quite different from English. There are 
long literary traditions in most of the languages of India, but 
they are primarily concerned with religion and classical themes. 
Much is written in an archaic or literary style far removed from 
the contemporary spoken languages. The writing systems are 
generally based on the syllable, except for Urdu, which is 
written in the Perso-Arabic script which may omit vowels. In 
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either case, it is not cleir what counting 'letters' would mear 
as an index of word complexity. 

A current educational project now going on in India is t-j 
create tests if reading achievement in seven of the regional 
languages. To do this, and to create reading materialr for 
particular grades, it is necessary to have som& idea of which 
texts are generally within the reading ability of children at a 
particular grade level. No official norms currently exist; in 
fact, one of the goals of creating the tests of reading 
achievement is to establish some norms for state educational 
bodies. There were several ways of approachini this task. One 
would have been to take the readability tradition used in the 
United States and apply it to the seven regional languages with 
modifications in the sampling procedure- -counting syllables or 
characters- -and to the approximate grade levels, as jstablished 
by samples of texts read by groups of children. This approach, 
in effect, makes the creation of the means of assessing texts 
into the goal of establishing norms, at great expense of time and 
effort. The alternative which has actually been taken has been 
to find a group of texts known by experience to be appropriate 
for the age level, which is 12 years. These texts were chosen by 
teachers who have had experience with that level of developient 
and school achievement in students. The texts and the questions 
which measure comprehension, are being -ried out on samples of 
students, and those that give the most consistent responses will 
be used in the test of reading achievement. (I am grateful to 
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Dr. R. Shieedliar, Central institute of Indian Languages, Mysore, 
for information and discussion.) 

This approach makes use of information which is already 
available --the experi^^nce of teachers- -and appl7lss it directly to 
creating the test, wi,i.ch is the primal^ goal. As long as ther<5 
is a pool of teachers who teach reading in a particular language, 
it will be possible to draw again upon their judgm jnts to create 
new vers:i,ons of the test. Thir reliance on the judgment of 
experienced and intelligent people has probably saved number of 
years which would otherwise have been spent in recalibrating 
readability formulas. It directly addresses the educational goal 
of finding out the norms for reading achievement. It: appears to 
be a wise use of time, human resources, and money well suited to 
the circumstances. 

Languages without previous traditions of writing. In 
the previous example, the basis for a test of heading ability was 
an educational tradition which already exists for the languages 
in question. School primers and other reading material have been 
in use for a number of years, giving the teachers some firsthand 
knowledge of the problems children have in reading. If there is 
no currently existing stock of texts used for teaching reading, 
it is difficult to know how to create texts for teaching reading 
that presont written language in the right order of increasing 
difficulty. This is the problem faced by the Yupik community of 
Alaska, who want to try to preserve their language (along with 
Engl, h) by teaching their children to learn to read with Yupik 
as the medium. Needless to say, the sentence and word structure 
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of this language are very different from English. Without such 
intervention, the language will soon be lost as children learn 
only English from television and movies, as well as school. In 
thiis situation, it would not be a good method of teaching reading 
to use text materials which are too hard for the children, or 
which are too simple and riot appropriate for older children. 

Instead of trying to adapt English-based readability 
formulas to Yupik, the members of the community have tried to 
draw on their own knowledge and experience as speakers of Yupik. 
One of the approaches being tried out is to study the scylistic 
features of spoken Yupik, to record and analyze how people giv^s 
information, tell stories, arid explain procedures to children. 
These oral texts, and the general features of style, can then be 
transferred to the written medium and tried out on groups of 
children. In this way, texts in Yupik can be created for some 
different age groups, though not necessarily graded into very 
fine grade -level distinctions. (I am indebted to Dr. Anthony 
Woodbury, Department of Linguistics, University of Texas, for 
information and discussion.) 

In the above examples, I have tried to define exactly what 
features of the situation, texts or readers would make the Use of 
readability formulas inappropriate for grading or simplifying 
texts. In place of formulas, a close analysis of the features of 
the text, readers, or situation allowed existing resources to be 
used instead. In some cases, the alternative is deliberately 
chosen over formulas. But in other cases, there really is no 
choice- -formulas could not be used without radical alteration 
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requiring years of research. The results are not known in all 
cases, and when they are known, they may not be startling. All 
that has been shown is that some success can be obtained by 
paying attention to actual readers, texts, and features of 
language. But the question is not whether alternatives are 
significantly more successful than the use of readab:;: lity 
formulas. Sometimes they are, sometimes not. But each attempt 
to deal with non-absti-act properties of texts and readers adds to 
the general sum of knowledge about how language is understood. 
Research On Language Processing 

In the previous section, I have tried to point out that 
there are often cheaper, more direct methods of increasing or 
predicting comprehension of written iateriais which make use of 
already existing resources- -the experience of teachers, as well 
as the knowledge which readers are able to bring to the reading 
of texts. In this section, in contrast, I want to talk about 
some research for which the methods are just beginning to be 
developed. It investigates questions about which very little is 
known by even the most expert investigators. 

1 will contrast the properties of readability formulas, with 
their implied view of language, with some of the properties of 
language for which we are beginning to have firm evidence, even 
if the whole pictitre of how language is processed is still 
incomplete. Readability formulas address the issue of what 
constitutes or reflects complexity of language, or at least this 
issue may be read into them by implication. Whatever one may 
feel Jbout the use of readability formulas as applied to 
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educational or technical materials, the issue of what constitutes 
complexity in language has very great importance in its own 
right. 

Camp lex itv and formulas . Readability formulas typically 
measure average sentence length, in words or syllables, and word 
complexity in syllable length or frequency. As has been pointed 
out innumerable times, these are very superficial linguistic 
measures, and they were designed to be superficial. They are 
superficial because they are easy to define, and are properties 
of all texts. When other measures are added, they also are ones 
which are easily defined and counted, such as pronouns of various 
types. It is often pointed out that these variables are not 
measures of complexity per se . They have some relation to the 
factors which actually cause a text to be complex, so that they 
are really only reflections of the actual causes of complexity. 
On this view, there is some continuity through a text from the 
properties of the most superficial aspects of word choice and 
sentence structure, to syntactic structure and organization of 
conterit of words, to the most abstract level of meaning. 

But there 's not always perfect and continuous correlation 
of text difficulty and linguistic features. The following 
pasisage is difficult to understand: 

Further, the belief about the good that it is good and that 
about the not good that it is not good are alike and so, 
tod, are the belief about the good that it is not good and 
that about the not good that it is good. What belief then 
is contrary to the true belief about the not good that it is 
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not good? Certainly not the one which says that it is 
bad, for this might sometimes be true at the same time, 
while a true belief is never contrary to a true one. 
(Aristotle, De iaterpretatione . J. Ackrill trans, p. 67) 
The length of the sentences alone (3.5 sentences in 100 words) 
would suggest that the passage is not for elementary school 
children, but the words are not technical or difficult in 
themselves, except for contrary . But clearly the meaning of this 
passage is immensely more difficult to grasp than would be 
predicted by the iar*guage it is written in. This is not to say 
that the s5rntax of the passage is simple, or that phrases like 

^ood and the not good are easy to grasp. The meaning is 
independently more complex than the language it is expressed in, 
and so the language does not necessarily reflect semantic 
complexity. 

The predictive power of readability formulas rests on a 
correlation between superficial features and comprehension 
measured in some way. The surface features are not always 
assumed to eaus^ difficulties of comprehension. But there is no 
reason why they should not be sources of difficulty in 
themselves. Unfamiliar words in written form may be hard to 
identify and to relate to the reader's mental lexicon. Long 
sentences may be hard to process simply because there are so many 
parts to be related to one another. Formulas embody the entirely 
plausible notion that if the capacity of a reader to process a 
certain amount of information in a given interval is exceeded, 
comprehension is disrupted. The problem with this, which has 
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never been (explicitly addressed in research on readability 
formulas, is that it is completely vague. We don't know what 
unit complexity is measured in, whether sentence structure and 
word properties arie measured in the same units, what interval of 
time they are contained in, whether this is fixed or flexible, or 
how comprehension is defined. 

These are the issues which I will survey in this section. 
We can assume that meaning and linguistic expression are not 
totally dependent one on the other. The research discussed above 
has shown that complex meaning can be made more understandable by 
changes in surface expression and in the way the text is read. 
It is therefore possible that the language in which some texts 
are written contributes to the complexity of the text, and 
impedes comprehension in some way. 

l£ complexity a fixed value? Certain researchers hsv^ 
recognized that sentence length itself is imperfectly correlated 
with difficult sentence structure. A long sentence can be long 
because it consists of a string of coordinate clauses, which 
present very little problems in processing (1), or because there 
are subordinate clauses, which are more difficult to process. 
But not all subordinate clauses are alike, in that internal and 
left branching clauses (2) are more difficult to process than 
right branching clauses (3): 

(1) A constituent wrote a letter and the letter was informative 

and the congressman quoted him [the constituent]. 

(2) The letter [the constituent [the congressman quoted] wrote] 

was informative. 
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(3) The letter was informative [which the constituent wrote [who 

the congressman quoted]]. 
Researchers such as Botel, Dawkins arid Granowsky (1973) addressed 
this problem directly, doing some research in linguistics and 
computation. They proposed a parsing program which would assign 
weightings to internal or embedded structures, like those in (2) 
and (3), and additional weighting to non-right branching 
structures, as in (3). in this research program, it was hoped 
that it would be possible to measure the syntactic density of the 
sentences in a text at fairly close intervals. Whether such 
structures are actually more complex to understand as a general 
class is an empirical issue (cf. Frazier, 1984, p. 184 for 
evidence which differentiates types of subordinate clauses). 

This approach depends on the very general assumption that 
complexity is a fixed value: If a construction of a particular 
type is relatively more difficult to understand than a 
corresponding but different construction, then the former 
construction is always complex. This assumption has some 
intuitive appeal- -since the linguistic features which make it 
complex persist every time the construction is used. That is, if 
there are perceptual or memory limits which are overloaded when a 
subordinate clause is placed in a particular relation, then this 
overload should occur whenever the construction occurs. it ought 
to be possible to use the weighting program or a more taxonomic 
approach to identify syntactic structures which are complex, 
provided that complexity is a fixed value. A taxonomy can be 
based either on a general characterization of syntax or on 
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rejsearch on the order in which children learn constructions 
(Dawkins, 1975). This assumes that children learning to deal with 
sentence jstructures succeed first with the ^simple and regular 
cases and then with the complex and exceptional cases; 

There is a great deal of truth to these approaches, except 
for the fact that complexity seems not to be a fixed value. 
Complex constructions are not relatively more complex than their 
counterparts, provided that the linguistic context supports the 
complex construction. What this means is that the complexity 
is offset by contextual information which matches the 
conjstruction . 

For example, the research on how children acquire and 
undersi-and language has always indicated that passive sentences 
are more complex than active sentenc-is. There seems to be a very 
plausible explanation for this fact, since passive and other 
cDmplex sentence types do not indicate grami ^tlcal relations of 
subject and object in the normal way (Davison, 1984). The 
sentence object in a passive clause is picked out differently 
than one in an active sentence. If the passive clause is 
preceded by an antecedent for the object , it takes less time to 
understand it than if the sentence is preceded by an antecedent 
for the agent phrase. This finding is also true of other 
construction types which are difficult in isolation. The right 
kind of antecedent in the preceding context shortens the 
processing time, even though more complex syntac ::c structure 
does require more processing time than a less complex structure 
(Davison & Lutz, 1984). 
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A syntactic structure which appears to be very complex is 
the restrictive i: ilative clause (4). it is learned by young 
children later than other ways^ of combining sentences, such as 
coordination (5): 

4) The dog [which ran away from next door] chased our cat. 

5) The dog ran away from next door and it chased our cat. 
In some experiments designed to test comprehension in young 
children (aged three to six), children often seem to interpret a 
sentence like (4), with a restrictive clause, as though it had 
the structure of (5), referring to two separate events ^ both of 
which are asserted by saying (5). Hamburger and Grain (1982) 
have proposed that these results do not accurately reflect what 
young children know about their language. 

First, Grain and others have found that children as young as 
three can pick out the correct meaning of sentences like (4) when 
they are asked to point to pictures instead of making dolls act 
but situations, which is a more complex task. Second, four-year- 
old children were able to produce and understand restrictive 
relative clause constructions correctly when the relative clauses 
were used appropriately, with the right context. The context 
must contain various assumptions- -that the event described by the 
relative clause has already occurred and is known to the speaker 
and hearer^ that there is something which is being described by 
the relative clause, and the information in the relative clause 
helps to pick what that referent is. The clause which ran away 
from nesct door in (4), helps to distinguish a particular dog from 
all the other dogs in the discourse context, and is not used just 
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as a way of describing a dog, as it is in (5). Restrictive 
relative clauses are more complex only if Used in isolation 
without appropriate support- from the situation in which they 
occur. 

This conclusion should have been obvious, since language is 
^or communication. The grammar of a language contains many 
forms for expressing meaning, some more complex than others. The 
more complex forms are not gratuitous, not just ways of 
communicating in more eni-matric and difficult ways, instead, they 
express complex combinations of grammatical, semantic and 
contextual information In very efficient ways. Hence, complexity 
is a feature of syntactic structures, but it is relative and not 
absolute. If complex structures are tested in their appropriate 
environments, they turn out to be less complex than in :'.solation. 
There is some tradeoff between inherent complexity and 
efficiency of communication. 

How is complexity measured in experimental situations? 
Earlier research on iow syntactic structures are comprehended 
gave very discouraging results. There seemed to be no effects, 
or very weak effects, of varying syntactic structures. It 
appeared that syntactic structure did not enter into 
comprehension in any interesting way, even when children were the 
subjects, and if anyone should have problems with understanding 
complex structures, it should be children in the age range before 
grammar is fully learned. But an explanation has emerged in the 
last 10 years or so. 
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The prci'iem is in using memory as a test for the processing : 
of sjrhtactic structures. Memory (recall, recognition) is 
relevant for testing comprehension of info rmatibri- -the content of 
sentences. But as studies like Bransford, Barclay and Franks 
(1972) showed, people have trouble picking out exactly which form 
of a sentence they have previously read. They recognize 
sentences which express the meaning of a sentence or grdUp of 
sentences which were previously read, but are very inaccurate in 
recognizing exactly the sentences which they saw. The 
explanation which has been proposed by many researchers is that 
the surface form of language in a text is not stored in long- term 
memory in verbatim form. Information is stored in some kind of 
interpreted form, in which it can be related to previous 
knowledge or condensed and used as the basis for inferences (see 
Johnson- taird, 1983, for an overview). So all kinds of effects 
of s3mtactic and lexical structure might be found, but not by 
using long- term memory as the measure. 

What should be the way of getting at the effects of 
S3mtactic structure and other surface features of language? 
Language is processed very rapidly. Even when words are repeated 
back as fast as the subject is able to do that, some kind of 
interpretation goes on. Marslen-Wilson (1975) showed that 
subjects can repeat what they have heard within a quarter to a 
third of a second, and in that time are able to correct or 
reinterpret small errors in syntax, semantics or sounds. From 
studies like these, it has been proposed that language processing 
is rapid, which means that not ve*.y much is processed at one 
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time, and it is interactive, which means that many different 
kinds of information are processed together. 

The result has been that research on the effects of 
syntactic structure in sentence processing has begun to measure 
what goes on, while the sentence ts being understood. It appears 
that the kind of memory used in processing language is short-term 
or working memory, which takes small chunks of a sentence to work 
on in short inter\'als, measured in seconds or fractions of a 
second. Subjects are asked *:o respond at certain times by making 
c:oices, or producing a word, or simply indicating that they have 
comprehended a word or a sentence. The time it takes a subject 
to make a response is measured. More complex tasks of 
interpretation are assumed to take more time or be more prone to 
error and reinterpretation. Research which records eye movements 
also provides a very exact measurement of how long it took to 
read sentences with particular structures. For a survey of some 
current research of this kind, see chapters in Dowty, Karttunen, 
and Zwicky (1984). 

A great deal has been learned from experimental studies like 
these, as well as from models of how language should be 
organized, based dri what we know about the features of human 
language and the human cognitive capacity. The picture is far 
from complete, however, and there is no answer as yet to the 
question of what makes a text difficult for a given individual to 
comprehend. These studies do not give information which could be 
substituted tomorrow for a readability formula. But they do shed 
light on an issue which is central to language processing and 
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also to readability formulas. That is the nature of short-term 
or working memory. 

Unfortunately, very little is known about the short-term 
memory capacity of both adults and children, though it is clear 
that when this capacity is exceeded, there are difficulties in 
comprehension. Various factors contribute to overload, including 
syntactic and semantic density at a given interval, but it is 
unclear exactly whac these factors are and how they add up to 
boing too complex. Individuals differ in how efficient they are 
at using short-term memory, and children change and develop in 
how well they can use their choft-term memory capacity (Case, 
Kurland, & Goldberg, 1982). There is also a tradeoff between 
capacity and efficiency; the studies which are surveyed in 
Huggins and Adams (1980) showed that children preferred sentence 
structures which allowed them to process as much information as 
possible up to the limits of their capacity. It it is not clear 
at present what direct implications this research has for the 
questions which readability formulas ought to answer but do not. 
This is a promising area of research, however, in which results 
should yield a miore realistic and useful view of what constitutes 
complexity in language. 

Conclusion 

Recent research on readability, in the narrow sense of 
readability formulas, has concentrated or* statistical refinement, 
computer implementation, and greater ease of application. 
Measurement of text features other than sentence length and word 
complexity has not been explored, and comparatively little 
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isystematic research has been done on how to write texts which are 
within the range of readers at a given level. The progress which 
has been achieved has been in the technical area, not in 
theoretical discussions of what formulas really are 
representations of or why they do or do not work. This being the 
case, it is unlikely that much progress will be made in the near 
future in answering some of the real questions which people want 
answers to- -what makes this text difficult for those students to 
read, how can the text be made better, or what texts features are 
interrelated? 

There is now, as in recent yssars, a certain amount of 
research on readability in the broader sense, which goes directly 
to features of texts and readers in specific situations. But, 
unfortunately these studies are not perceived as a systematic and 
coordinated effort to find an alternative to the formula-like 
approach. Compared with the predictive power of formulas (which 
holds for large aggregates of texts and readers and not for 
smaller groups), the Results of a specific attempt to make a text 
more readable or to match texts and readers may look very small 
and insignificant. Each such study addresses a fairly small 
number of factors, and since there are so many which might 
influence the comprehension of a text or a part of it, the 
results of one study are seldom carried over to further research. 
Yet there will be no greater understanding of what makes a text 
complex if research on alternatives to formulas is allowed to be 
demoralized by the com^^arison of the success in each attempt with 
the overall predictions of formulas. 
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Certainly in the area of research on producing texts, it is 
imperative to understand what goes into the understanding of 
written language, and to have a model bf how comprehension of 
language works. By relating research on readability to research 
in psychology and linguistics on language processing, it is 
possible to make each attempt to go beyond formulas have some 
effect. Let us hope that some of the research being done on 
specific educational and social problems, as well as theoretical 
research on language processing, will eventually provide the 
insight into these questions which has eluded us for so long. 
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